Chaining Multiple - Alignment Fragments in Sub - Quadratic

نویسندگان

  • Gene Myers
  • Webb Miller
چکیده

We describe a multiple-sequence alignment algorithm for determining the highest-scoring alignment that can be obtained by chaining together non-overlapping subalignments selected from a given collection of such \fragments". For a given set of K sequences, a problem instance consists of a set of F precomputed fragments, an alignment score for each fragment, and a \gap" penalty function that assigns a cost for chaining two fragments together. When interpreted as a maximum weight path problem in a directed acyclic graph, it is computable in (F 2) time. Here we instead interpret the problem as \K-dimensional sparse dynamic programming" and take advantage of its underlying geometric nature. Assuming K < log F, our algorithm runs in time O(F log K F) and space O(KF log K?1 F), making it the rst sub-quadratic sparse dynamic programming algorithm for the case K > 2.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Local Chaining Algorithm and Its Applications in Comparative Genomics

Given fragments from multiple genomes, we will show how to find an optimal local chain of colinear non-overlapping fragments in sub-quadratic time, using methods from computational geometry. A variant of the algorithm finds all significant local chains of colinear nonoverlapping fragments. The local chaining algorithm can be used in a variety of problems in comparative genomics: The identificat...

متن کامل

Matt: Local Flexibility Aids Protein Multiple Structure Alignment

Even when there is agreement on what measure a protein multiple structure alignment should be optimizing, finding the optimal alignment is computationally prohibitive. One approach used by many previous methods is aligned fragment pair chaining, where short structural fragments from all the proteins are aligned against each other optimally, and the final alignment chains these together in geome...

متن کامل

A simple and space-efficient fragment-chaining algorithm for alignment of DNA and protein sequences

In the segment-based approach to sequence alignment. nucleic acid, and protein sequence alignments are constructed from fragments, i.e., from pairs of ungapped segments of the input sequences. Given a set F of candidate fragments and a weighting function w : F + FL:, the score of an alignment is defined as the sum of weights of the fragments it consists of. and the optimization problem is to fi...

متن کامل

Novel Definition and Algorithm for Chaining Fragments with Proportional Overlaps

Chaining fragments is a crucial step in genome alignment. Existing chaining algorithms compute a maximum weighted chain with no overlaps allowed between adjacent fragments. In practice, using local alignments as fragments, instead of Maximal Exact Matches (MEMs), generates frequent overlaps between fragments, due to combinatorial reasons and biological factors, i.e., variable tandem repeat stru...

متن کامل

Multiple Genome Alignment: Chaining Algorithms Revisited

Given n fragments from k > 2 genomes, we will show how to find an optimal chain of colinear non-overlapping fragments in time O(n logk−2 n log logn) and spaceO(n logk−2 n). Our result solves an open problem posed by Myers and Miller because it reduces the time complexity of their algorithm by a factor log 2 n log logn and the space complexity by a factor logn. For k = 2 genomes, our algorithm t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995